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SUFFICIENCY  AND  THE  NUMBER  OF  LEVEL  CROSSINGS  BY  A STATIONARY  PROCESS 

Benjamin  Kedem 

University  of  Maryland,  College  Park 

Summary . It  is  shown  how  to  derive  the  exact  distribution  of  the  num- 
ber of  axis  crossings  by  a stationary  process  when  the  binary  process 
obtained  by  clipping  the  original  process  is  a pth-order  Markov  chain. 
The  same  method  is  used  in  deriving  the  asymptotic  distribution  of  the 
number  of  upcrossings  of  a high  level  by  a stationary  process. 

Key  words  and  phrases:  binary,  Markov  chain,  level  crossings,  symbol 
changes,  upcrossings,  high  level 


1.  Introduction . Let  Z^,  t=l,...,n,  be  a strictly  stationary  time 
series,  and  let  X^. , t=l,...,n,  be  the  binary  time  series  which  takes 
the  values  1 whenever  Z^.  i a and  0 otherwise.  X^  as  well  as  quantities 
defined  by  it  should  be  indexed  by  the  level  a,  but  except  in  one  case 
we  shall  avoid  this  indexing  for  the  sake  of  simplified  notation. 
Associated  with  are  the  statistics 

n n n n 


D(n)  = 2 7 X -2  T X X_  -(X,+X  ) and  D (n) 
t = l t t = 2 t t_1  1 n a 


l xx 

t = 2 


t-1* 


D(n)  counts  the  number  of  symbol  changes  in  the  binary  series  and 

hence  it  counts  the  number  of  crossings  of  level  a by  Z^.  When  X^  + 

X = 0,  D (n)  counts  the  number  of  upcrossings  of  level  a by  Z . We 
n d ^ 

shall  find  the  distribution  of  D(n),  n fixed,  for  level  a = 0 and  the 
asymptotic  distribution  of  D (n),  as  a,n  -+  00  in  a suitable  manner, 
when  X^  is  either  a first  or  second-order  Markov  chain.  The  same 
technique  applies  to  higher  order  chains. 

We  shall  make  use  of  the  results  in  Kedem  (1976,a).  Consequently 


we  define 
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p = Pr(Zt>a), 

X.  = P (Z  »a 
k r t 

k = 1 , 2 

p.  = P (Z^. 
r t 

>a|Zt_1>a,  Zt 

_2-a)  ’ 

s = zxt. 

R1  = ZXtVl’ 

R2  = ZXtXt-2> 

c = zxt 

Xt-lXt-2’  H =X1  + V 

u = X2  + 

Xn -1 ’ V = X1X2 

+ X ,X  . 
n -1  n 

For  a review  of  level  crossings  problems  and  an  extensive  biblio- 
graphy see  Leadbetter  (1972). 


2.  The  number  of  axis  crossings.  In  this  section  a = 0 and  p = 1/2. 

That  is  P (Z  >0)  = 1/2. 
r t 

Theorem  1.  If  X^.  is  a first-order  Markov  chain,  then  the  number  of 
axis  crossings  by  Z^  , t = l,...,n,  has  a binomial  distribution 
b(n-l,l-X  ) . 


Proof . The  probability  of  a 0-1  series  for  which  D(n)  = d is  given  by 

d/v—  v — \ — i ( y \ \d\(n  — 1)— d (1) 

P _ r\  (1“X_)  X 

r l 1 nn  i li 


and  there  are  2(n^''')  such  sequences.  Multiply  this  number  by  (1)  to 
obtain  the  desired  binomial  distribution. 


Observe  that  under  the  conditions  of  the  theorem  D(n)  is  minimal 

A 

sufficient  for  and  the  maximum  likelihood  estimate  of  is  = 

{ ( n -1 ) -D( n ) }/ ( n -1 ) while  VrK^-X)  is  asymptotically  N(0 ,X^( 1-X^  ) ) . 

Just  when  may  we  expect  the  above  binomial  distribution  to  be  a 
reasonable  approximation  to  the  actual  distribution  of  the  numDer  of 
axis  crossings?  So,  consider  a stationary  AR(1)  process  = 4>Z^_1  + 
ut»  | $|  < ut  are  independent  N(0,1)  variates.  For  each  of  19 
values  of  <j>  1000  time  series  of  size  5 were  generated.  The  size  was 
fixed  at  5 to  allow  the  expected  number  of  successes  in  each  cell  in 
a multinomial  experiment  to  exceed  1 in  1000  repetitions.  We  wish  to 
test  H^:  D(n)~b(n-1, 1-  X^)  where  now  n = 5 and  X.  = i-  + — sin  1 (♦  ) . The 
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results  of  the  19 


chi-square  goodness  of  fit  tests  are  summarize 


Table  1.  It  is  seen  that  the  results  are  very  satisfactory  for 
-0.5  8 8 <^  <}>  5 0.600  where  HQ  is  accepted  at  level  of  significance  0.01. 
This  example  indicates  that  the  above  binomial  distribution  is  rea- 
sonable when  neighboring  observations  in  the  Z^  series  are  at  most 


moderately  correlated. 

Table  1:  Observed  (expected)  frequencies  of  the  number  of  axis  cros- 
sings by  Zt=<}>Zt_i  + ut,  t = 1 , . . . , 5 , ut  N(  0 , 1)  , in  1000 
independent  realizations. 


X 

0 

1 

2 

3 

4 

*(4) 

. 809 

. 800 

457 

322 

171 

43 

7 

56 .235* 

(409.6) 

( 409 . 6 ) 

(153.6) 

(25.6) 

(1.6) 

. 707 

.750 

358 

366 

190 

73 

13 

50.716* 

(316.4) 

(421.87) 

(210 .94) 

(46.87) 

(3.91) 

. 600 

.705 

259 

405 

243 

80 

13 

6.505 

( 247  ) 

(413.4) 

( 260  ) 

(72.3) 

(7.6) 

.588 

.700 

253 

400 

252 

81 

14 

6.303 

(240.1) 

(411.6) 

( 264 . 6 ) 

(75.6) 

(8.1) 

.500 

.666 

209 

391 

290 

87 

23 

11 .549 

(197.4) 

(394.9) 

(296 .2) 

(98.72) 

(12.3) 

.400 

.631 

169 

365 

318 

119 

29 

7 . 387 

(158.5) 

(370.8) 

(325.2) 

( 126 . 8 ) 

(18.5) 

. 309 

.600 

143 

331 

336 

157 

33 

4.483 

(129.6) 

( 345.6  ) 

( 345.6  ) 

(153.6) 

(25.6) 

.250 

.580 

127 

315 

344 

173 

41 

5.783 

(113.1) 

(327.8) 

( 356) 

<N 

r-i 

(31.1) 

.100 

.532 

85 

281 

363 

215 

56 

1.908 

( 80) 

( 282  ) 

( 372  ) 

( 218) 

(48) 

.000 

. 500 

64 

254 

378 

233 

71 

2.436 

(62.5) 

( 250  ) 

( 375  ) 

( 250  ) 

(62.5) 

-.100 

.468 

53 

217 

366 

374 

90 

2.099 

(48) 

(218) 

( 372  ) 

( 282  ) 

(80) 

-.250 

.420 

37 

175 

345 

315 

128 

4.007 

( 31.1) 

(171.2) 

( 356  ) 

( 327.8) 

(113.1) 

-.309 

.400 

30 

160 

341 

326 

143 

3.580 

(25.6) 

(153.6) 

(345.6) 

(345.6) 

(129.6) 

-.400 

. 369 

22 

133 

335 

335 

175 

6.433 

(18.5) 

(126.8) 

(325.2) 

( 370 . 8 ) 

(158.5) 

- . 500 

. 333 

18 

113 

294 

365 

210 

7 .791 

(12.3) 

(98.72) 

( 296.2  ) 

( 394.9 ) 

(197.4) 

-.588 

. 300 

18 

97 

258 

358 

269 

28 . 779* 

(8.1) 

(75.6) 

(264.6) 

(411.6) 

(240.1) 

-.600 

.295 

18 

93 

259 

358 

272 

30.116* 

(7.6) 

(72.3) 

(260) 

(413.4) 

(247) 

-.707 

.250 

12 

71 

213 

343 

361 

50.268* 

(3.91) 

(46.87) 

(210.94) 

(421.87) 

(316.4) 

-.809 

.200 

10 

37 

157 

305 

491 

92.191* 

(1.6) 

(25.6) 

(153.6) 

(409.6) 

( 409.6) 

* Indicates  that  the  hypothesis  HQ  is  rejected  at  level  of  significance  0.01. 
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A more  realistic  assumption  is  that  X^.  displays  a higher  order 
dependence.  The  extension  of  Theorem  1 to  the  case  when  X^_  is  a 
k th-order  Markov  chain  is  somewhat  more  involved  but  straightforward. 
For  this  purpose  let  us  consider  the  second-order  case  in  detail;  the 
k th-order  case  follows  an  identical  argument. 

When  X^.  is  a second-order  chain  it  was  shown  in  Kedem  ( 1976, a) 
that  { S , , R2  , C , H , U , V } is  a set  of  sufficient  statistics  for  X^,X2>p, 

and  their  joint  distribution  is  given  there.  An  equivalent  but  a 
more  convenient  set  of  sufficient  statistics  is  {S , D( n ) , F , Z ' , H , U , V } 
where  D(n)  = 2S-2R^-H,  F = R^-C  is  the  number  of  1-runs  in  the  X^_ 
series  with  two  or  more  l's  and  Z'  = R2 -C  is  the  number  of  0-runs  be- 
tween the  first  and  last  1 with  exactly  one  0.  It  follows  that  the 
joint  distribution  of  the  last  set  can  be  obtained  from  that  of  the 
first  one.  We  have 

g(  s , d , f , z ' ,h  , u , v)  = Pr(S  = s,  D(n)=d,  F = f , Z ' = z ' , H = h,  U = u,  V = v) 

_1  d 

= N(s,d,f,z>  ,h,u,v)Kn(C1S2?3C4)S(52C3?4)  2 <.Z3ZH)~fZ*'  (2) 

1_ 

• [U253S4)  2c5]h5g?7, 

where  K ,£  are  functions  of  p = 1/2, X X ,p  and  are  given 

n i y 4 / 1 J t 

in  Kedem  (1976)  and 
N(  s ,d , f , z ' ,h  , u , v ) 

j 2 , /max(h,u)i  /i<d+h)-l\  / n-s-i<d-h)-2  X/|<d-h)\  /s-|<d+h)-l\ 

\max(h,u)/\  v l\  z'  I \ i-(  d -h ) -z ' -u+vl  I f-v  )\  f-1  / 

Theorem  2 . If  X^  is  a second  order  Markov  chain  then  the  distribution 
of  the  number  of  crossings  by  Z^ , t = l,...,n  is  given  by 

h + u+n  -4 

P (D(n)-d)  = l l l I g(  s , d , f , z * ,h,u,v) , (3) 

(h,u,v)  s = h+u  f = v z’=0 
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where  (h,u,v)  takes  values  in  ( 1 , 2 , 1 ) , ( 1 , 1 , 1 ) , ( 1 , 1 , 0 ) , ( 1 , 0 , 0 ) when 
d is  odd  and  in  ( 2 , 2 , 2 ) , ( 2 , 1 , 1 ) , ( 0 , 2 , 0 ) , ( 2 , 0 , 0 ) , ( 0 , 1 , 0 ) , ( 0 , 0 , 0 ) 
when  d is  even. 

In  principle  it  is  possible  to  extend  our  method  to  obtain  the 
distribution  of  D(n)  when  the  0-1  series  displays  a higher  order  de- 
pendence but  the  joint  distribution  of  the  sufficient  statistics  be- 
bomes  messier. 


3.  Upcrossings  of  a high  level.  In  this  section  we  shall  elicit  the 
Poisson  nature  of  the  upcrossings  of  a high  level  a by  Z^.,  by  using 
the  above  method  of  examining  the  joint  distribution  of  several  suf- 
ficient statistics.  The  Poisson  nature  of  these  upcross  ings  [ 3 ”]  has  been 
known  for  nearly  twenty  years  for  continuous  parameter  Gaussian  pro- 
cesses under  various  moment  conditions.  Z , however,  is  not  neces- 
sarily Gaussian. 


Theorem  3.  Assume  X^.  is  a first  order  Markov  chain.  If  a,n  -*■  * such 
that 

(i)  nPr(Zt  - a)  = a,  a remains  constant, 

(ii)  Pr(Zt  > a |zt_1  2 a)  = X^a)  \ , 


then 

-a( l-\. ) . 

e 1 [a(l-X, )] 

lim  P (D  ( n ) =k  ) = r-: = , k = 0,l,...  . (4) 

a-*-® 


Proof . A simple  combinatorial  argument  shows  that 


P (S=s,D  (n)  = k,H  = 0) 
r d 


k s-n+2, s-k 

P q x 


(i-xl) 


2k 


• ( l-2p+X1p ) 


n-l-s-k 


Replace  p by  a/n  and  q by  1-a/n  and  note  that  {H=0}  becomes  a sure 
event  as  a -*  ®.  Then 


v- 


J 
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lim  P (S=s,D  (n)=k) 

and  sum  over  s . 


-a( 1-X  ) 

[a(l-X1)]Ke  1 


As  consequences  we  have  firstly 

-a( 1 -X  ) 

lim  P ( max  Z < a ) = e 
a-*-“  r t = 1 , . . . ,n 


(5) 


(6) 


and  secondly,  the  asymptotic  distribution  of  S,  the  total  time  spent 
above  a high  level  a,  is  the  Polya-Aeppli  distribution  obtained  by 
summing  (5)  over  k,  with  mean  a and  variance  a( 1+X  )/( 1-X^) . 

Similar  results  can  be  obtained  for  the  second-order  case.  To 
simplify  matters  assume  z'  ->  0 as  a -*■  00  with  probability  one,  which 
happens  if  and  only  if  X2~X^|j.  -*-0,  a -*•  °° . 

Theorem  4.  If  is  a second-order  Markov  chain  such  that  (i)  and 
(ii)  above  hold  and 

( iii ) Pr(Zt>a|Zt_1>a,Zt_2>a)  = n(a)  -*■  |x , 

2 ' 

(iv)  (X2~X1p.)  -*■  1 with  probability  one,  as  a 

then  D (n)  has  an  asymptotic  Poisson  distribution  with  parameter 

cl 

ad-X^  . 


Proof.  From  (2)  with  p = a/n  and  the  fact  that  { H = 0 , U = 0,  V=0}  be- 
comes a sure  event,  it  follows  that 

k -a( 1-X 

lim  P ( S=s , D (n)=k,F=f)  = «-e 

F cl  K • 

a-*» 

.f  s-k-f,,  v2fn  •>  , r,  > ,k-f 

X1(i  (1-p.)  Ll-X1(2-p.)J 

But 


s-k-1! 


s=k+f\  f-1 


s-k-f  _ , . . -f 

H = (1-n) 


and 
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k 

l 


f = 0 


k 

f 


[X1(l-ti)]f[l-X1(2-ti)]k_f 


d-x1)k  , 


so  tha t P (D  (n)=k) 
r a 


-a(l-X1 

e 


[a(l-X1)]k/k!  . 


4 . Some  applications. 

When  parameters  of  interest  are  related  in  some  fashion  to  the 
number  of  axis  crossings,  Theorem  1 can  be  used  in  deriving 
appropriate  estimators  and  their  approximate  distributions.  We 
bring  two  such  cases. 


Estimation  in  AR(l).  Suppose  Z^_  = + ut  is  a stationary 

AR(1)  process  as  above,  and  suppose  it  is  clipped  at  level  D.  If 
the  clipped  process  X^_  approximates  a first  order  Markov  chain, 
then  the  maximum  likelihood  estimate  of  based  on  the  clipped  data 

is 


; , , ? , . /(n-1)  - (#  of  axis  crossings)  l\ 
* = *<V  = sin  — 2/ 

Experience  shows  [2]  that  this  estimator  behaves  remarkably  well 
even  when  | <#>  | is  close  to  1.  When  1 4>  | is  small  so  that  the 
binomial  approximation  to  the  distribution  of  the  number  of  axis 
crossings  is  adequate,  it  follows  directly  that 


Vr \(.<t>-<t>)  — N(  0 ,n2Xl(  1-X^  ) ) , n ~ . 


Estimation  of  the  mean  frequency.  Let  Z(t)  - " < t < ”,  be  a zero 
mean  stationary  Gaussian  process  with  correlation  function  p( t ) . 
Asaume  that  the  sample  functions  are  continuous  with  probability 
one  and  that  in  a sufficiently  small  time  interval,  say  A,  the 


8 


probability  that  Z(t)  has  more  than  one  0 is  negligible.  Con- 
sider the  interval  [0,t]  and  partition  it  into  (n-1)  subinter- 
vals of  size  A.  Then  (n-l)A  = T.  We  hold  T fixed  as  A -*■  0 

and  n -*•  " simultaneously.  Let  X^  ^ take  the  value  1 when 

Z((i-1)A)  2 0,  and  0 otherwise,  i = l,...,n,  and  let  D(n) 

be  the  number  of  symbol  changes  in  the  X^  ^ series.  If  D is  the 

true  number  of  axis  crossings  in  [0,T)  then  D(n)  -*■  D , n -*•  *(  A -*  0 ) 
a.s.  As  a first  approximation  to  the  distribution  of  D(n)  we  take 

D(ri)  ~ b(n-l,  l-\,  ).  Thus,  by  1'Hospital's  rule 

1 ,n 

E(D)  = lim  E ( D(n ) ) = lira  ^ L j i si~L(p(A))l  = “ y, 

n-*°°  A-*Q 

r = [-P"(o)]1/2 


provided  the  derivative  exists.  y is  called  the  mean  frequency. 
A reasonable  estimate  for  y is  then  [4] 

* n D(  n ) 


whose  approximate  distribution  is  easily  obtained  from  that  of  D(n). 
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